Search for: All records

Creators/Authors contains: "Ting, Yuan-Sen"

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language model

https://doi.org/10.1038/s41598-025-97131-y

de_Haan, Tijmen; Ting, Yuan-Sen; Ghosal, Tirthankar; Nguyen, Tuan Dung; Accomazzi, Alberto; Wells, Azton; Ramachandra, Nesar; Pan, Rui; Sun, Zechang (December 2025, Scientific Reports)

Abstract AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for research in astronomy, astrophysics, cosmology, and astronomical instrumentation. Trained on the complete collection of astronomy-related arXiv papers from 2007 to 2024 along with millions of synthetically-generated question-answer pairs and other astronomical literature, AstroSage-Llama-3.1-8B demonstrates remarkable proficiency on a wide range of questions. AstroSage-Llama-3.1-8B scores 80.9% on the AstroMLab-1 benchmark, greatly outperforming all models—proprietary and open-weight—in the 8-billion parameter class, and performing on par with GPT-4o. This achievement demonstrates the potential of domain specialization in AI, suggesting that focused training can yield capabilities exceeding those of much larger, general-purpose models. AstroSage-Llama-3.1-8B is freely available, enabling widespread access to advanced AI capabilities for astronomical education and research.
more » « less
Full Text Available
Scaling Laws for Emulation of Stellar Spectra

https://doi.org/10.33232/001c.140607

Różański, Tomasz; Ting, Yuan-Sen (January 2025, The Open Journal of Astrophysics)

Neural network-based emulators for the inference of stellar parameters and elemental abundances represent an increasingly popular methodology in modern spectroscopic surveys. However, these approaches are often constrained by their emulation precision and domain transfer capabilities. Greater generalizability has previously been achieved only with significantly larger model architectures, as demonstrated by Transformer-based models in natural language processing. This observation aligns with neural scaling laws, where model performance predictably improves with increased model size, computational resources allocated to model training, and training data volume. In this study, we demonstrate that these scaling laws also apply to Transformer-based spectral emulators in astronomy. Building upon our previous work with TransformerPayne and incorporating Maximum Update Parametrization techniques from natural language models, we provide training guidelines for scaling models to achieve optimal performance. Our results show that within the explored parameter space, clear scaling relationships emerge. These findings suggest that optimal computational resource allocation requires balanced scaling. Specifically, given a tenfold increase in training compute, achieving an optimal seven-fold reduction in mean squared error necessitates an approximately 2.5-fold increase in dataset size and a 3.8-fold increase in model size. This study establishes a foundation for developing spectral foundational models with enhanced domain transfer capabilities.
more » « less
Full Text Available
Quantifying Bursty Star Formation in Dwarf Galaxies

https://doi.org/10.33232/001c.128414

Ting, Yuan-Sen; Ji, Alexander (January 2025, The Open Journal of Astrophysics)

Dwarf galaxy star formation histories are theoretically expected to be bursty, potentially leaving distinct imprints on their chemical evolution. We propose that episodic starbursts with quiescent periods longer than ~100 Myr should lead to discontinuous tracks in a dwarf galaxy’s [ $α$ /Fe]-[Fe/H] chemical abundance plane, with metallicity gaps as large as 0.3-0.5 dex at [Fe/H] = -2. This occurs due to continued Fe production by Type Ia supernovae during quiescent periods. We demonstrate that Gaussian mixture models can statistically distinguish discontinuous and continuous tracks based on the Akaike Information Criterion. Applying this method to APOGEE observations of the Sculptor dSph galaxy suggests an episodic star formation history with ~300 Myr quiescent periods. While current dwarf galaxy datasets are limited by small spectroscopic sample sizes, future surveys and extremely large telescopes will enable determining large numbers of precise chemical abundances, opening up the investigation of very short timescales in early dwarf galaxy formation. This unprecedentedly high time resolution of dwarf galaxy formation in the early Universe has important implications for understanding both reionization in the early Universe and the episodic star formation cycle of dwarf galaxies.
more » « less
Full Text Available
Enhanced magnetic activity in rapidly rotating binary stars

https://doi.org/10.1038/s41550-025-02562-2

Yu, Jie; Gehan, Charlotte; Hekker, Saskia; Bazot, Michäel; Cameron, Robert H; Gaulme, Patrick; Bedding, Timothy R; Murphy, Simon J; Han, Zhanwen; Ting, Yuan-Sen; et al (May 2025, Nature Astronomy)

Full Text Available
Inferring Cosmological Parameters on SDSS via Domain-generalized Neural Networks and Light-cone Simulations

https://doi.org/10.3847/1538-4357/ad73d4

Lee, Jun-Young; Kim, Ji-hoon; Jung, Minyong; Oh, Boon Kiat; Jo, Yongseok; Park, Songyoun; Lee, Jaehyun; Ting, Yuan-Sen; Hwang, Ho Seong (October 2024, The Astrophysical Journal)

Abstract We present a proof-of-concept simulation-based inference on Ω_mandσ₈from the Sloan Digital Sky Survey (SDSS) Baryon Oscillation Spectroscopic Survey (BOSS) LOWZ Northern Galactic Cap (NGC) catalog using neural networks and domain generalization techniques without the need of summary statistics. Using rapid light-cone simulations L-picola, mock galaxy catalogs are produced that fully incorporate the observational effects. The collection of galaxies is fed as input to a point cloud-based network,Minkowski-PointNet. We also add relatively more accurate Gadgetmocks to obtain robust and generalizable neural networks. By explicitly learning the representations that reduce the discrepancies between the two different data sets via the semantic alignment loss term, we show that the latent space configuration aligns into a single plane in which the two cosmological parameters form clear axes. Consequently, during inference, the SDSS BOSS LOWZ NGC catalog maps onto the plane, demonstrating effective generalization and improving prediction accuracy compared to non-generalized models. Results from the ensemble of 25 independently trained machines find Ω_m= 0.339 ± 0.056 andσ₈= 0.801 ± 0.061, inferred only from the distribution of galaxies in the light-cone slices without relying on any indirect summary statistics. A single machine that best adapts to the Gadgetmocks yields a tighter prediction of Ω_m= 0.282 ± 0.014 andσ₈= 0.786 ± 0.036. We emphasize that adaptation across multiple domains can enhance the robustness of the neural networks in observational data.
more » « less
Full Text Available
C3PO – IV. Co-natal stars depleted in refractories are magnetically more active – possible imprints of planets

https://doi.org/10.1093/mnras/staf436

Yu, Jie; Ting, Yuan-Sen; Casagrande, Luca; Liu, Fan; Wang, Sharon X; Sun, Qinghui; Huber, Daniel; Chen, Boquan; Cordoni, Giacomo; Da Costa, Gary; et al (March 2025, Monthly Notices of the Royal Astronomical Society)

ABSTRACT Chemical abundance anomalies in twin stars have recently been considered tell-tale signs of interactions between stars and planets. While such signals are prevalent, their nature remains a subject of debate. On the one hand, exoplanet formation may induce chemical depletion in host stars by locking up refractory elements. On the other hand, exoplanet engulfment can result in chemical enrichment, and both processes potentially produce similar differential signals. In this study, we aim to observationally disentangle these processes by using the Ca ii infrared triplet to measure the magnetic activity of 125 co-moving star pairs with high signal-to-noise ratio, and high-resolution spectra from the Magellan, Keck, and VLT (Very Large Telescope) telescopes. We find that co-natal star pairs in which the two stars exhibit significant chemical abundance differences also show differences in their magnetic activity, with stars depleted in refractories being magnetically more active. Furthermore, the strength of this correlation between differential chemical abundances and differential magnetic activity increases with condensation temperature. One possible explanation is that the chemical anomaly signature may be linked to planet formation, wherein refractory elements are locked into planets, and the host stars become more active due to more efficient contraction during the pre-main-sequence phase or star–planet tidal and magnetic interactions.
more » « less
Full Text Available
LESSPayne: Labeling Echelle Spectra with SMHR and Payne

Ji, Alexander P; Casey, Andrew R; Ting, Yuan-Sen; Holmbeck, Erika M; Frebel, Anna; Usman, Sam A; Limberg, Guilherme; Shah, Shivani P; Chiti, Anirudh; Ezzeddine, Rana; et al (March 2025, Astrophysics Source Code Library)

Full Text Available
Extending the chemical reach of the H3 survey: detailed abundances of the dwarf-galaxy stellar stream Wukong/LMS-1

https://doi.org/10.1093/mnras/stae969

Limberg, Guilherme; Ji, Alexander P; Naidu, Rohan P; Chiti, Anirudh; Rossi, Silvia; Usman, Sam A; Ting, Yuan-Sen; Zaritsky, Dennis; Bonaca, Ana; Borbolato, Lais; et al (April 2024, Monthly Notices of the Royal Astronomical Society)

ABSTRACT We present the first detailed chemical-abundance analysis of stars from the dwarf-galaxy stellar stream Wukong/LMS-1 covering a wide metallicity range ($$-3.5 \lt \rm [Fe/H] \lesssim -1.3$$). We find abundance patterns that are effectively indistinguishable from the bulk of Indus and Jhelum, a pair of smaller stellar streams proposed to be dynamically associated with Wukong/LMS-1. We confirmed a carbon-enhanced metal-poor star ($$\rm [C/Fe] \gt +0.7$$ and $$\rm [Fe/H] \sim -2.9$$) in Wukong/LMS-1 with strong enhancements in Sr, Y, and Zr, which is peculiar given its solar-level [Ba/Fe]. Wukong/LMS-1 stars have high abundances of α elements up to $$\rm [Fe/H] \gtrsim -2$$, which is expected for relatively massive dwarfs. Towards the high-metallicity end, Wukong/LMS-1 becomes α-poor, revealing that it probably experienced fairly standard chemical evolution. We identified a pair of N- and Na-rich stars in Wukong/LMS-1, reminiscent of multiple stellar populations in globular clusters. This indicates that this dwarf galaxy contained at least one globular cluster that was completely disrupted in addition to two intact ones previously known to be associated with Wukong/LMS-1, which is possibly connected to similar evidence found in Indus. From these ≥3 globular clusters, we estimate the total mass of Wukong/LMS-1 to be $${\approx }10^{10} \, \mathrm{M}_\odot$$, representing ∼1 per cent of the present-day Milky Way. Finally, the [Eu/Mg] ratio in Wukong/LMS-1 continuously increases with metallicity, making this the first example of a dwarf galaxy where the production of r-process elements is clearly dominated by delayed sources, presumably neutron-star mergers.
more » « less
Full Text Available
Dwarf galaxy archaeology from chemical abundances and star-formation histories

https://doi.org/10.1093/mnras/stad2985

Johnson, James W; Conroy, Charlie; Johnson, Benjamin D; Peter, Annika H; Cargile, Phillip A; Bonaca, Ana; Naidu, Rohan P; Woody, Turner; Ting, Yuan-Sen; Han, Jiwon Jesse; et al (October 2023, Monthly Notices of the Royal Astronomical Society)

ABSTRACT We model the stellar abundances and ages of two disrupted dwarf galaxies in the Milky Way stellar halo: Gaia-Sausage Enceladus (GSE) and Wukong/LMS-1. Using a statistically robust likelihood function, we fit one-zone models of galactic chemical evolution with exponential infall histories to both systems, deriving e-folding time-scales of τin = 1.01 ± 0.13 Gyr for GSE and $$\tau _\text{in} = 3.08^{+3.19}_{-1.16}$$ Gyr for Wukong/LMS-1. GSE formed stars for $$\tau _\text{tot} = 5.40^{+0.32}_{-0.31}$$ Gyr, sustaining star formation for ∼1.5–2 Gyr after its first infall into the Milky Way ∼10 Gyr ago. Our fit suggests that star formation lasted for $$\tau _\text{tot} = 3.36^{+0.55}_{-0.47}$$ Gyr in Wukong/LMS-1, though our sample does not contain any age measurements. The differences in evolutionary parameters between the two are qualitatively consistent with trends with stellar mass M⋆ predicted by simulations and semi-analytic models of galaxy formation. Our inferred values of the outflow mass-loading factor reasonably match $$\eta \propto M_\star ^{-1/3}$$ as predicted by galactic wind models. Our fitting method is based only on Poisson sampling from an evolutionary track and requires no binning of the data. We demonstrate its accuracy by testing against mock data, showing that it accurately recovers the input model across a broad range of sample sizes (20 ≤ N ≤ 2000) and measurement uncertainties (0.01 ≤ σ[α/Fe], σ[Fe/H] ≤ 0.5; $$0.02 \le \sigma _{\log _{10}(\text{age})} \le 1$$). Due to the generic nature of our derivation, this likelihood function should be applicable to one-zone models of any parametrization and easily extensible to other astrophysical models which predict tracks in some observed space.
more » « less
Full Text Available
Carbon-enhanced metal-poor star candidates from BP/RP spectra in Gaia DR3

https://doi.org/10.1093/mnras/stad1675

Lucey, Madeline; Al Kharusi, Nariman; Hawkins, Keith; Ting, Yuan-Sen; Ramachandra, Nesar; Price-Whelan, Adrian M; Beers, Timothy C; Lee, Young Sun; Yoon, Jinmi (June 2023, Monthly Notices of the Royal Astronomical Society)

ABSTRACT Carbon-enhanced metal-poor (CEMP) stars comprise almost a third of stars with [Fe/H] < −2, although their origins are still poorly understood. It is highly likely that one sub-class (CEMP-s stars) is tied to mass-transfer events in binary stars, while another sub-class (CEMP-no stars) are enriched by the nucleosynthetic yields of the first generations of stars. Previous studies of CEMP stars have primarily concentrated on the Galactic halo, but more recently they have also been detected in the thick disc and bulge components of the Milky Way. Gaia DR3 has provided an unprecedented sample of over 200 million low-resolution (R ≈ 50) spectra from the BP and RP photometers. Training on the CEMP catalogue from the SDSS/SEGUE database, we use XGBoost to identify the largest all-sky sample of CEMP candidate stars to date. In total, we find 58 872 CEMP star candidates, with an estimated contamination rate of 12 per cent. When comparing to literature high-resolution catalogues, we positively identify 60–68 per cent of the CEMP stars in the data, validating our results and indicating a high completeness rate. Our final catalogue of CEMP candidates spans from the inner to outer Milky Way, with distances as close as r ∼ 0.8 kpc from the Galactic centre, and as far as r > 30 kpc. Future higher resolution spectroscopic follow-up of these candidates will provide validations of their classification and enable investigations of the frequency of CEMP-s and CEMP-no stars throughout the Galaxy, to further constrain the nature of their progenitors.
more » « less
Full Text Available

« Prev Next »